System and method for providing switch based subnet management packet (smp) traffic protection in a middleware machine environment

ABSTRACT

A system and method can provide switch based subnet management packet (SMP) traffic protection in a middleware machine environment. The middleware machine environment includes a network switch that operates to receive at least one SMP destined for a subnet management agent (SMA). The network switch can check whether the at least one SMP includes a correct management key, and prevent the at least one SMP from being forwarded to the destined SMA when at least one SMP does not include the correct management key. Furthermore, the network switch can specify a different management key for each external port and can enforce separate restrictions on ingress and egress SMP traffic at a particular external port.

CLAIM OF PRIORITY

This application claims priority to U.S. Provisional Patent Application No. 61/506,557, entitled “SYSTEM AND METHOD FOR USING UNICAST AND MULTICAST FLOODING MECHANISMS TO PROVIDE EoIB GATEWAY vNICs” filed Jul. 11, 2011, and U.S. Provisional Patent Application No. 61/645,517, entitled “SYSTEM AND METHOD FOR PROVIDING SECRET MANAGEMENT KEY IN A MIDDLEWARE MACHINE ENVIRONMENT” filed May 10, 2012, which applications are herein incorporated by reference.

CROSS REFERENCE TO RELATED APPLICATIONS

This application is related to the following patent application, which is hereby incorporated by reference in its entirety:

U.S. patent application Ser. No. ______, titled “SYSTEM AND METHOD FOR SUPPORTING SUBNET MANAGEMENT PACKET (SMP) FIREWALL RESTRICTIONS IN A MIDDLEWARE MACHINE ENVIRONMENT”, inventors Bjørn Dag Johnsen, Roy Arntsen and Lars Paul Huse, filed ______(Attorney Docket No.: ORACL-05398US0).

COPYRIGHT NOTICE

A portion of the disclosure of this patent document contains material which is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or the patent disclosure, as it appears in the Patent and Trademark Office patent file or records, but otherwise reserves all copyright rights whatsoever.

FIELD OF INVENTION

The present invention is generally related to computer systems and software such as middleware, and is particularly related to supporting a middleware machine environment.

BACKGROUND

The interconnection network plays a beneficial role in the next generation of super computers, clusters, and data centers. High performance network technology, such as the InfiniBand (IB) technology, is replacing proprietary or low-performance solutions in the high performance computing domain, where high bandwidth and low latency are the key requirements. For example, IB installations are used in supercomputers such as Los Alamos National Laboratory's Roadrunner, Tex. Advanced Computing Center's Ranger, and Forschungszcntrum Juelich's JuRoPa.

IB was first standardized in October 2000 as a merge of two older technologies called Future I/O and Next Generation I/O. Due to its low latency, high bandwidth, and efficient utilization of host-side processing resources, it has been gaining acceptance within the High Performance Computing (HPC) community as a solution to build large and scalable computer clusters. The de facto system software for IB is OpenFabrics Enterprise Distribution (OFED), which is developed by dedicated professionals and maintained by the OpenFabrics Alliance. OFED is open source and is available for both GNU/Linux and Microsoft Windows.

SUMMARY

Described herein are systems and methods for providing switch based subnet management packet (SMP) traffic protection in a middleware machine environment. The middleware machine environment includes a network switch that operates to receive at least one SMP destined for a subnet management agent (SMA) components. The network switch can check whether the at least one SMP includes a correct management key, and prevent the at least one SMP from being forwarded to the destined SMA when at least one SMP does not include the correct management key. Furthermore, the network switch can specify a different management key for each external port and can enforce separate restrictions on ingress and egress SMP traffics at a particular external port.

BRIEF DESCRIPTION OF THE FIGURES

FIG. 1 shows an illustration of supporting a management key protection model in a middleware machine platform, in accordance with an embodiment of the invention.

FIG. 2 shows an illustration of providing switch based SMP traffic protection in a middleware machine environment, in accordance with an embodiment of the invention.

FIG. 3 illustrates an exemplary flow chart for providing switch based SMP traffic protection in a middleware machine environment, in accordance with an embodiment of the invention.

DETAILED DESCRIPTION

Described herein is a system and method for providing a middleware machine or similar platform. In accordance with an embodiment of the invention, the system comprises a combination of high performance hardware, e.g. 64-bit processor technology, high performance large memory, and redundant InfiniBand and Ethernet networking, together with an application server or middleware environment, such as WebLogic Suite, to provide a complete Java EE application server complex which includes a massively parallel in-memory grid, that can be provisioned quickly, and can scale on demand. In accordance with an embodiment, the system can be deployed as a full, half, or quarter rack, or other configuration, that provides an application server grid, storage area network, and InfiniBand (IB) network. The middleware machine software can provide application server, middleware and other functionality such as, for example, WebLogic Server, JRockit or Hotspot JVM, Oracle Linux or Solaris, and Oracle VM. In accordance with an embodiment, the system can include a plurality of compute nodes, IB switch gateway, and storage nodes or units, communicating with one another via an IB network. When implemented as a rack configuration, unused portions of the rack can be left empty or occupied by fillers.

In accordance with an embodiment of the invention, referred to herein as “Sun Oracle Exalogic” or “Exalogic”, the system is an easy-to-deploy solution for hosting middleware or application server software, such as the Oracle Middleware SW suite, or Weblogic. As described herein, in accordance with an embodiment the system is a “grid in a box” that comprises one or more servers, storage units, an IB fabric for storage networking, and all the other components required to host a middleware application. Significant performance can be delivered for all types of middleware applications by leveraging a massively parallel grid architecture using, e.g. Real Application Clusters and Exalogic Open storage. The system delivers improved performance with linear I/O scalability, is simple to use and manage, and delivers mission-critical availability and reliability.

M_Key Protection Model

FIG. 1 shows an illustration of supporting a management key protection model in a middleware machine platform, in accordance with an embodiment of the invention. As shown in FIG. 1, a management key, such as an M_Key 102, can be used to protect an IB fabric (or an IB subnet) 100. The values for the M_Key 102 may only be known by fabric administrators 110, which can have administrator access to the switches 103-104 and the designated subnet manager (SM) nodes 101 in the I B subnet/fabric 100. The integrity of the M_Key(s) 102 depends on the integrity of the fabric level administration passwords used by the fabric administrators 110, as well as the physical access protection of the switches 103-104 in the IB subnet/fabric 100, e.g. in a data center.

In the IB fabric 100, a secure HCA firmware implementation in HCA 121-124 can keep the type and identity of various fabric nodes well defined. Each of the HCA121-124 can implement a subnet management agent (SMA) components131-134, each of which can be associated with an M_Key 141-144. Furthermore, the connected switches A-B 103-104 can be controlled by the fabric administrator 110. Thus, any rogue SMA implementation 131-134 may not compromise the fabric administrator 110 defined M_Key 102 values that are used in the IB subnet/fabric 100.

Additional descriptions of various embodiments of using secure HCA firmware implementation in a middleware machine platform are provided in U.S. patent application Ser. No. 13/487,973, entitled “SYSTEM AND METHOD FOR PROVIDING SECURE SUBNET MANAGEMENT AGENT (SMA) IN AN INFINIBAND (IB) NETWORK”, filed Jun. 4, 2012, which application is herein incorporated by reference.

Furthermore, the fabric administrator 110 can ensure that new M_Key values 102 for the IB subnet/fabric 100 are installed out-of-band on switches 103-104 (as well as for the relevant subnet manager instances 101). Additionally, the fabric administrator 110 can ensure that there is infinite M_Key 102 lease time on the switches 103-104. Thus, the host based software 161-164, e.g. a host based subnet manager on different hosts 111-114 (including an operating system 151-154), can not hijack the control of any switch 103-104 in the IB subnet/fabric 100.

In accordance with an embodiment of the invention, a single M_Key 102 value (or a single set of M_Key values) can be used for various nodes in the in the IB subnet/fabric 100 based on the IB specification defined access restrictions. The correct value for a current M_Key 102 may need to be specified before either reading or updating the M_Key 102, since the secure HCA firmware can ensure that the “read protected” M_Key assigned to the local HCA 121-124 is not exposed to local host based software.

Additionally, local software 161-164 on different hosts 111-114 may be able to hi-jack the HCA port by setting up its own M_Key value, in the case when the current M_Key value for HCA ports is defined at run-time. Also, the host local software 161-164 may make the HCA port unmanageable for the designated subnet manager 101, e.g., before the designated subnet manager 101 sets up any M_Key 102 for the HCA 121-124.

In accordance with an embodiment of the invention, a designated subnet manager 101 can ignore any HCA ports with un-known M_Key value and leave the corresponding link not initialized. The only impact of a hijacked HCA port M_Key can be that the HCA port may not be operational, and the designated subnet manager 101 can prevent host based software from communicating via this HCA port using normal communication, i.e. non-SMP/VL15 based communication.

Furthermore, when host software 111-114 compromises the local HCA M_Key value, the offending host software may be able to bring the HCA port to an operational state with activated local identifiers (LIDs) and partition membership. In such a case, if the switch port on a switch 103-104 that connects to the HCA 121-124 is controlled by a different M_Key value that is not known to the host software 111-114 that has compromised the local HCA M_Key value, then the offending host software 111-114 may not be able to bring the link to a full operational state that allows normal data traffic.

In accordance with an embodiment of the invention, the IB fabric 100 can prevent direct route SMPs between the various hosts 111-114 in order to avoid various potentially threatening scenarios. In one scenario, a host, e.g. host 111, can use the direct route SMPs to hijack the M_Key of the HCA port on a remote host, e.g. 112, after the remote host 112 and/or the remote HCA 122 are reset. This can cause the remote HCA 122 port to become inaccessible from the SM 101 and thereby prevent the remote host 112 from participating in normal IB communication, i.e. a denial of service (DoS) attack. In another scenario, when two hosts, e.g. host 111 and host 114, are compromised by hackers, the cooperating administration in the IB fabric 100 that depends on direct route SMPs may allow the two compromised hosts to exchange information using direct route SMPs.

The IB fabric 100 can support the cooperating administration for exchanging information between different hosts 111-114 without depending on direct route SMPs. For example, the administrators for the hosts can access a shared web-page on the Internet instead of relying on direct route SMPs in the IB fabric 100. From a fabric security perspective, leaving direct route SMPs as a security hole on the IB fabric may be considered a worse situation than allowing both host administrators to access a shared web-page on the Internet.

In accordance with an embodiment of the invention, the HCA ports may be set up with finite lease time on M_Keys 102, e.g. due to a high availability concern with the subnet manager(s) 101 that maintains the M_Key 102 lease period. Thus, the M_Keys 102 can expire without the associated link going down. Consequently, the state of the HCA 121-124, e.g. the partition membership, may be updated while links are still in active mode and the LID routes for the involved port are still operational. Then, the IB fabric 100 without M_Key protection may mistakenly allow normal IB traffic between a hi-jacked host and the hosts in other partitions.

Furthermore, if the M_Keys 102 expire before the links going down, both the local HCA, e.g. HCA 121, and any remote HCA, e.g. HCA 124, may be hi-jacked and the partition membership may be modified. If the associated switch ports, e.g. on switches 103-104, are not set up to perform partition enforcement, then the traffic with non-solicited partition membership can reach any other node in the fabric.

Additionally, a subnet manager 101 within the IB fabric 100 can depend on a designated virtual lane (VL), e.g. the VL15 buffering, in order to correctly monitor and control the IB fabric 100 and negotiate with other subnet managers in the IB fabric 100. Since the VL15 buffering within the IB fabric 100 is a shared resource, the uncontrolled use of SMPs from any host can represent a DoS attack. This can affect subnet manager 101 operations, since the M_Key protection within the IB fabric 100 may prevent the hosts from changing any SMA state on any node. Thus, there is a need to provide SMP traffic protection in the IB fabric 100.

In accordance with an embodiment of the invention, the M_Key 102 can be created and managed by fabric administrators 110 and stored in secured memory on switches A-B 103-104 and/or HCAs121-124. A microprocessor on a switch A-B 103-104 or a HCA121-124 can access the memory for reading out the M_Key 102 or writing the M_Key 102 into the memory.

Switch Based Proxy M_Key Protection

In accordance with an embodiment of the invention, M_Key checking of SMPs can be performed in intermediate switch nodes in an IB fabric to ensure that the local switch M_Key setting can protect remote HCA ports that do not have an M_Key set up.

FIG. 2 shows an illustration of providing switch based SMP traffic protection in a middleware machine environment, in accordance with an embodiment of the invention. As shown in FIG. 2, a middleware machine environment 200 can comprise an IB switch 201 that connects to a host 203 via a HCA 202. The host 203 can include a host stack software 205 running on top of an operating system 207. The IB switch 201 can include one or more switch ports 211-216, each of which can be used for connecting with separate nodes or entities in the IB fabric 200, e.g. switch port 211 is connected with the HCA 202.

A subnet management agent (SMA) component 206, which is implemented in firmware 204 on the HCA 202, can communicate with other nodes in the IB fabric 200 via the switch port 211. Also, the designated subnet manager 208 in the IB fabric 200 can use a particular switch port, e.g. port 211, for both sending direct route SMP requests to any SMA 206 and receiving direct route SMP responses from the SMA 206.

The switch 201 can prevent un-intended SMP traffic occurring in the IB fabric 200, without depending on requiring all HCAs to have trusted firmware with SMP control. For example, the switch 201 can filter direct route SMP traffic that is not consistent with a fabric policy for the IB fabric 200.

The switch 201 can use a filtering scheme to prevent a remote HCA port 202 from being hijacked by an intruder. The filtering scheme can be based on identifying any direct route SMP request that targets setting an SMA 206 attribute. Additionally, the filtering scheme can perform the same M_Key check for all direct route SMP requests independently of the destination, and may require that the direct route SMP requests include the correct M_Key 209 for the local switch 201 independently of which destination the SMP is targeting.

A single M_Key 209 can be used in an IB fabric 200, which includes the switch 201 and the HCA ports 202 that directly connects to the switch 201. If the host stack software 205 is able to compromise the M_Key 219 that protects the local HCA 202, then the host stack software 205 may also compromise the M_Key 209 protecting the local switch 201, since all SMP traffic in the IB fabric 200 includes the local switch M_Key 209.

In accordance with an embodiment of the invention, the switch 201 implementation can specify an optional M_Key value for each external port, e.g. M_Keys 221-226 for external ports 211-216. Additionally, the switch 201 implementation can ensure that any SMP sent out from a switch port, e.g. switch port 211, and any SMP received from this port all have an M_Key value that matches the M_Key 221 specified for the switch port 211. Furthermore, the network switch 201 can enforce separate restrictions on SMPs 220 sending from an external port 214 and SMPs 210 receiving at the external port 214.

Using this mechanism, a legitimate subnet manager 208 can ensure that all potentially untrusted remote HCA ports or other potentially untrusted remote ports may only be allowed to send out SMPs, when the correct local M_Key associated with a switch port is specified. Also, attempts to access a remote port may need to have the subnet manager 208 defined M_Key for that port independently of whether the remote port M_Key has been hi-jacked or not. Additionally, this mechanism can also specify an SMP rate that defines how fast the remote port can generate SMP, in order to prevent or reduce the chance of SMP based DoS attacks from an untrusted port in the fabric.

Additional descriptions of various embodiments of using SMP filtering in a middleware machine platform are provided in U.S. Pat. No. 7,398,394, entitled “SYSTEM AND METHOD FOR AUTHENTICATING NODES IN A COMMUNICATION NETWORK”, filed Jun. 2, 2004 and issued Jul. 8, 2008, which application is herein incorporated by reference.

In accordance with an embodiment of the invention, a filtering scheme can prevent illegal host-host based direct route SMP traffic based on declaring the switch ports as either “trusted” or “not trusted.” The determination of whether a switch port, or the entity connecting to the switch port, is trusted or not can be based on either explicit policy input to the local switch 201, or automatic authentication of the remote port.

As shown in FIG. 2, the switch ports 211, 213-214, and 216 are trusted switch ports, while the switch ports 212 and 215 are not trusted. The filtering scheme may only allow SMP requests to be sent from the trusted ports (egress from the switch), and SMP responses to be received by the trusted ports (ingress to the switch). Additionally, only SMA requests, which have entered the subnet from a trusted port in the first place, may be allowed for egress to un-trusted ports. Thus, the filtering scheme can be made independent of any current M_Key setting, and can also be used together with the above M_Key based filtering scheme.

In accordance with an embodiment of the invention, the ability to use a single M_Key 209, or a single set of M_Keys, throughout an IB subnet/fabric 200 depends on whether all nodes in the IB fabric 200 are trusted and do not expose the M_Key in use to any entity that does not have the required privileges. A pre-requisite for a subnet manager 208 to include the current M_Key in a request is that the subnet manager 208 can be assured that the target and any intermediate agent may not compromise the integrity of the M_Key. In one example, such trustfulness can be established prior to including the current M_Key in any SMP. Thus, the fabric configurations may require all nodes in the IB fabric, including the HCAs, to be authenticated (or declared) as trusted before any M_Key based communication can takes place, instead of assuming that all SMA instances are trustful a priori.

In accordance with an embodiment of the invention, a mechanism for sending and receiving vendor based SMPs via the switch management interface can be provided. Such a mechanism enables a switch embedded authentication mechanism to operate as part of the switch local software and thereby operate in concert with embedded subnet managers and the embedded switch driver and SMA stack.

FIG. 3 illustrates an exemplary flow chart for providing switch based SMP traffic protection in a middleware machine environment, in accordance with an embodiment of the invention. As shown in FIG. 3, at step 301, a network switch can receive one or more SMPs destined for a subnet management agent (SMA). Then, at step 302, the network switch can check whether the one or more SMPs include a correct management key. Additionally, at step 303, the network switch can prevent the one or more SMPs from being forwarded to the destined SMA when the one or more SMPs do not include the correct management key.

The present invention may be conveniently implemented using one or more conventional general purpose or specialized digital computer, computing device, machine, or microprocessor, including one or more processors, memory and/or computer readable storage media programmed according to the teachings of the present disclosure. Appropriate software coding can readily be prepared by skilled programmers based on the teachings of the present disclosure, as will be apparent to those skilled in the software art.

In some embodiments, the present invention includes a computer program product which is a storage medium or computer readable medium (media) having instructions stored thereon/in which can be used to program a computer to perform any of the processes of the present invention. The storage medium can include, but is not limited to, any type of disk including floppy disks, optical discs, DVD, CD-ROMs, microdrive, and magneto-optical disks, ROMs, RAMs, EPROMs, EEPROMs, DRAMs, VRAMs, flash memory devices, magnetic or optical cards, nanosystems (including molecular memory ICs), or any type of media or device suitable for storing instructions and/or data.

The foregoing description of the present invention has been provided for the purposes of illustration and description. It is not intended to be exhaustive or to limit the invention to the precise forms disclosed. Many modifications and variations will be apparent to the practitioner skilled in the art. The embodiments were chosen and described in order to best explain the principles of the invention and its practical application, thereby enabling others skilled in the art to understand the invention for various embodiments and with various modifications that are suited to the particular use contemplated. It is intended that the scope of the invention be defined by the following claims and their equivalence. 

1. A method for providing switch based subnet management packet (SMP) traffic protection in a middleware machine environment operable on one or more microprocessors, comprising: receiving, via a network switch, one or more SMPs destined for a subnet management agent (SMA); checking, via the network switch, whether the one or more SMPs include a correct management key; and preventing, via the network switch, the one or more SMPs from being forwarded to the destined SMA when the one or more SMPs do not include the correct management key.
 2. The method according to claim 1, further comprising: filtering the one or more SMPs according to a fabric policy.
 3. The method according to claim 2, further comprising: allowing each of the one or more SMPs to be a direct route SMP.
 4. The method according to claim 2, further comprising: allowing a subnet manager to use one or more SMPs to communicate with a subnet management agent (SMA) via a particular switch port on the network switch.
 5. The method according to claim 1, further comprising: defining a different managment key for each external port on the network switch .
 6. The method according to claim 5, further comprising: allowing the network switch to enforce separate restrictions on SMPs sent from an external port to the SMA and SMPs received at the external port from the SMA.
 7. The method according to claim 1, further comprising: ensuring, via a subnet manager, that a untrusted remote host channel adapter (HCA) associated with the SMA can only send out one or more SMPs when a correct management key is specified.
 8. The method according to claim 1, further comprising: specifying an SMP rate that defines how fast a remote HCA port is allowed to generate SMPs.
 9. The method according to claim 1, further comprising: declaring one or mroe switch ports to be trusted and allowing SMP requests to be sent from the trusted ports, and SMP responses to be received at the trusted ports.
 10. The method according to claim 9, further comprising: allowing only SMA requests to be sent to an untrusted port from the trusted ports.
 11. A system for providing switch based subnet management packet traffic protection in a middleware machine environment, comprising: a subnet management agent (SMA) component; a network switch running on one or more microprocessors, wherein the network switch operates to receive one or more SMPs destined for the SMA component; check whether the one or more SMPs include a correct management key; and prevent the one or more SMPs from being forwarded to the destined SMA when at least one SMP does not include the correct management key.
 12. The system according to claim 11, wherein: the one or more SMPs are filtered according to a fabric policy.
 13. The system according to claim 12, wherein: each of the one or more SMPs is a direct route SMP.
 14. The system according to claim 12, wherein: a subnet manager can use one or more SMPs to communicate with a subnet management agent (SMA) via a particular switch port on the network switch.
 15. The system according to claim 11, wherein: a different managment key is defined for each external port on the network switch.
 16. The system according to claim 15, wherein: the network switch can enforce separate restrictions on SMPs sent from an external port to the SMA and SMPs receivied at the external port from the SMA.
 17. The system according to claim 11, further comprising: a subnet manager, which ensures that a untrusted remote host channel adapter (HCA) associated with the SMA can only send out a SMP when a correct management key is specified.
 18. The system according to claim 11, wherein: the network switch use an SMP rate to define how fast a remote HCA port is allowed to generate SMPs.
 19. The system according to claim 11, wherein: one or more trusted switch ports on the network switch allows SMP requests to be sent from the trusted ports, and SMP responses to be received at the trusted ports.
 20. A non-transitory machine readable storage medium having instructions stored thereon for providing switch based subnet management packet (SMP) traffic protection in a middleware machine environment that when executed cause a system to perform the steps of: receiving, via a network switch, one or more SMPs destined for a subnet management agent (SMA); checking, via the network switch, whether the one or more SMPs include a correct management key; and preventing, via the network switch, the one or more SMPs from being forwarded to the destined SMA when the one or more SMPs do not include the correct management key. 